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A  “DYNAMIC”  PROOF  OF  THE  FROBENIUS-PERRON 
THEOREM  FOR  METZLER  MATRICES 

Kenneth  J.  Arrow 
Stanford  University 


Matrices  with  non-negative  off-diagonal  elements  have  many  applications 
in  mathematical  economics  and  other  fields  of  investigation.  Economists 
have  ^alied  them  *bfetzler  matrices*  because  of  their  study  by  L.  Metsler 
(1945).  •’An  important  property,  especially  for  the  study  of  stability  of 
dynamic  systems,  is  that  the  largest  real  part  of  the  characteristic  roots  is 
itself  a  characteristic  root  and  has  a  semi-positive  characteristic  vector. 

There  is  a  less  well-known  property  of  linear  dynamic  systems  governed 
by  Metzler  matrices:  if  the  forcing  term  is  a  non-negative  vector  and  if 
the  system  starts  in  the  positive  orthant,  it  will  remain  there  forever.  .The 
proposition  seems  to  have  been  first  proved  by  Samuel  Karlin,  thonghnever 
published  by  him;  his  proof  is  referred  to  by  Beckenbach  and  Bellman 
(1961),  p.  137. 

Karlin’s  result  does  not  appear  to  be  derivable  from  the  standard  FVo- 
benius-Perron  theorem  (Theorem  4  below).  Its  proof  is  not  very  hard, 
however.  The  question  is  then  railed,  whether  the  FVobenius-Perron  result 
is  derivable  simply  from  Karlin^  theorem.  This  note  shows  that  the  an¬ 
swer  is  affirmative.  The  result  may  very  possibly  be  useful  for  expository 
purposes. 

I  start  by  demonstrating  Karlin’s  theorem,  for  completeness.  I  then  de¬ 
rive  several  familiar  properties  of  Metzler  matrices  from  which  the  FVoben¬ 
ius-Perron  theorem  can  be  derived. 

dep.  i:  A  is  a  Metzler  matrix  if  o,y  >  0  for  all  i  ^  j. 

There  are  several  ways  of  proving  Karlin’s  theorem.  In  the  following,  I 
use  the  concept  of  the  exponential  of  a  matrix  (for  a  more  elementary  but 
lengthier  proof,  see  Arrow  (I960)). 


Accesio'i  t  o1 

NTlS  CP  AM 
DT1C  TV.  LI 

(Jr, anno.,  -  .  i  J 
Justified  ti.-:  i 


4 

:j 

.j 


By 

Diswibution  I 


A  va.'h:!  *" 


!  Dirt 


,_L 


1 


2 


KENNETH  J.  ARROW 


dep.  j: 

exp  A  =  E“=0An/r»!. 

The  infinite  series  converges  absolutely  for  any  matrix  A. 

If  A  and  B  are  matrices  that  commute  (i.e.,  AB  =  BA),  then  it  is  easy 
to  see  that  the  binomial  theorem  is  valid  for  (A  +  B)n.  If  we  substitute  in 
the  infinite  series  defining  exp  ( A  +  B)  and  rearrange  terms,  it  is  easy  to 

see  that  • 

(1)  if  A  and  B  commute,  then  exp  (A  -f-  B)  =  (exp  A)(exp  B). 

If  A  is  non-negative,  then  of  course  A"  >  0  for  all  n.  Prom  Def.  2, 

(2)  exp  A  >  /  if  A  >  0. 

Now  suppose  A  is  a  Metzler  matrix.  By  Def.  1,  we  can  find  a  scalar, 
s  such  that  A  —  si  >  0.  Clearly,  si  commutes  with  any  matrix  and  in 
particular  with  A  —  si.  Then, 

exp  A  =  exp  [sJ  +  (A  -  s/)]  =  exp  [si)  exp  (A  -  si). 

But,  from  Def.  2,  exp  (si)  —  s' I,  since  In  —  I  for  all  n. 

(3)  exp  A  =  p  exp  (A  -  si)  for  some  scalar  p  >  0. 

From  (2),  with  A  replaced  by  A  —  si,  and  (3), 

(4)  if  A  is  Metzler,  exp  A  >  pi  for  some  p  >  0. 

Therefore, 

If  A  is  Metzler,  (exp  A)x  >  px  >  0  if  x  »  0,  and  (exp  A) i  >  p*  >  0  if 
x  >  0. 

1 

(I  use  the  notations  x  >  0,  x  >  0,  x  >  0,  to  mean,  respectively,  x,  >  0  ; 

for  all  i,  x  >  0  and  Xi  >  0  for  some  *,  and  x,  >  0  for  ’<  ) 

lemma  i.  If  A  is  Metzler,  (exp  A)x  »  0  »/  x  >  exp  A)x  >  0  if 
x  >  0. 
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Let  x(t)  be  the  solution  of  the  differential  equation, 

x  =  Ax  +  b, 

with  given  initial  condition  z(0).  Then  x(t)  can  be  written, 

[exp  (Au)6]du. 

Suppose  A  Metzler,  b  >  0,  and  x(0)  >>  0.  From  Def.  1,  At  is  Metsler 
for  t  >  0.  Then,  from  Lemma  1, 

theorem  i  (karlin).  If  A  u  Metzler  and  6  >  0  then  any  solution  of 
the  differential  equation, 

x  =  Ax+  b, 

for  which  x(0)  :»  0  has  the  property  that  x(t)  ;»  0  for  all  t  >  0. 

Now  suppose  that  A  is  a  stable  Metzler  matrix,  so  that  the  real  parts  of 
all  characteristic  roots  are  negative.  Take  any  vector  y  for  which  Ay  <  0. 
Take  any  solution  to  the  differential  equation, 


x(t)  =  (exp  At)x( 0)  -I- 


/: 


X  =  A(x  -  y), 


for  which  x(0)  »  0.  Since  A  is  stable,  lim*—,*,  x(t)  =  y.  Since  —Ay  >  0 
and  A  is  Metzler,  Theorem  1  implies  that  x(t)  >•  0  for  all  t  >  0.  Therefore, 
y  >  0. 

theorem  i.  If  A  is  a  stable  Metzler  matrix  and  Ay  <  0,  then  y  >  0. 

Suppose  Ay  <  0.  Then  clearly  y  =  0  is  impossible. 

corollary  i.  If  A  is  a  stable  Metzler  matrix  and  Ay  <  0,  then  y  >  0. 

Now  a  sort  of  converse  of  Theorem  2  can  be  shown.  Suppose  A  is  a 
Metzler  matrix  and  Ay  <.  0  for  some  y  >  0.  Clearly,  by  perturbing  y,  it 
can  be  assumed  that, 


(5)  AyC  0,y»0. 
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Let  the  superscript  T  denote  “transpose."  Let  x(t)  satisfy, 

(6)  x  =  AT  x,  x(0)  »  0. 

Then,  by  Theorem  1,  x(t)  »  0  for  t  >  0.  Define, 

(7)  5(t)  =  yTa:(t). 

Since  y  0  and  x(t)  :»  0, 

(8)  $(t)  >  0  for  t  >  0. 

From  (5), 

(9)  Ay  <  my  for  some  m  <  0. 

From  (7)  and  (6), 

£  =  yT  x  —  yT Atx  =  (Ay)r  x  <  myT  x  -  m£, 

By  integration  and  (8), 

0  <  £(t)  <  £(0)«m‘  for  all  t  >  0. 

Hence,  £(t)  must  approach  0  as  t  — ►  oo.  Since  y  »  0  and  z(t)  »  0  for  all 
t  >  0,  this  is  possible  only  if  x(t)  approaches  0. 

Thus,  every  solution  of  the  differentia]  equation, 

x  =  Atx, 

for  which  x(0)  0,  approaches  0.  But  then  every  solution  approaches  0  so 

that  At  and  therefore  A  is  stable. 

theorem  3.  If  A  is  Mitzltr  and  Ay  0  for  some  y  >  0,  then  A  ts 
stable. 

We  can  now  deduce  the  existence  of  a  dominant  root  and  semi-positive 
dominant  vector  from  Corollary  1  and  Theorem  3. 
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dep.  s:  cr(  A)  =  maximum  of  real  parts  of  characteristic  roots  of  A. 

For  any  s  >  cr(A),  A  —  si  is  stable.  Let  A,  in  particular,  be  Metxler. 
Then  A  —  si  is  Metzler  and  stable.  It  is  certainly  non-singular.  Choose  a 
fixed  vector 

(10)  c<£0. 

For  each  s  >  <r(A),  we  can  define  y(s)  as  satisfying 

(11)  {A- sl)y{s)  =  c. 

Then  by  Corollary  1, 

(12)  y(s)  >  0  for  all  s  >  a[A). 

Let  t  be  the  vector  all  of  whose  components  are  1.  Then,  eTy(s)  >  0. 
Define 

(13)  tj(s)  =  («ry(s)]-1,*(s)  =  rj(s)  y(s), 
so  that 

(14)  x(s)  belongs  to  the  unit  simplex,  5,  for  all  s  >  <r(A). 

From  (11)  and  (13), 

(15)  (A  —  3 J) x(s)  =  ri(s)c  for  s  >  <r(A). 

From  (14),  the  function,  z(s)  has  a  limit  point,  say  x,  as  s  approaches 
<t(A)  +  0.  Hence,  xtS,  and,  in  particular, 

(16)  x  >  0. 

By  definition,  there  exists  a  sequence  {sn}  such  that, 
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lim  sn  =  tr(  A)  +  0, 

n— »oo 

lim  x(s„)  =  x. 

n— *  oo 

If  we  substitute  s„  for  s  in  (15)  and  let  n  approach  oo,  then  the  left-hand 
side  converges,  and  therefore  the  right-hand  side  must  also  converge,  so 
that, 

lim  f?(s„)  =  t)  >  0. 

n— » oo 

FVom  (15),  with  a  replaced  by  an  and  n  approaching  infinity, 

(17)  [A  -  er(A)/jz  =  rjc. 

Suppose  r)  >  0.  Then  [A  —  o(A)I]x  C  0.  Prom  (16)  and  Theorem  3, 

0  >  crjA  -  cr(A)/]  =  <r(A)  —  cr(A)  =  0, 
a  contradiction.  Hence  rj  =  0,  so  that,  from  (17), 

Ax  =  u(A)x. 

theorem  «.  If  A  M  a  Mttzltr  matrix,  there  exists  a  real  number, 
a  and  a  real  vector  x  >  0  such  that,  (a)  Ax  =  ax,  and  (b)  for  every 
characteristic  root,  A,  of  A,  R( A)  <  a. 

The  usual  criteria  for  stability  and  for  existence  of  non-negative  solutions 
to  the  equation  Ax  =  b  for  Metzler  matrices  are  all  either  all  in  the  theorems 
already  stated  or  can  easily  be  deduced  from  them. 
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APPENDIX 

As  is  well  known,  Theorem  4  can  be  strengthened  in  one  way:  conclusion 
( b )  can  be  replaced  by, 

(b')  for  every  characteristic  root,  A,  of  A,  with  A  yk  <r,R(X )  <  a. 

In  view  of  (6),  this  is  equivalent  to  the  statement, 

(6")  if  a  +  ir  (t  real)  is  a  characteristic  root  of  A ,  then  r  =  0. 

I  now  prove  statement  (6")  using  Theorems  1  and  4.  Without  loss  of 
generality,  it  can  be  assumed  that 

(18)  cr(  A)  =  0. 

Suppose  (b")  false  for  some  A.  Then,  under  (18), 

(19)  it  is  a  characteristic  root  of  A  for  some  r  /  0. 

If  A  is  a  matrix  of  order  n,  let  S  be  any  subset  of  the  integers  1 . n, 

and  5'  its  complement. 

def.  4.  A  is  decomposable  if  there  exists  a  non-empty  set  S  with  non¬ 
empty  complement  such  that  Ass •  =  0.  Otherwise  A  is  indecomposable. 

It  is  easy  to  see  that  if  (18)  and  (19)  hold  for  some  Metzler  matrix, 
they  hold  for  some  indecomposable  Metzler  matrix.  By  successive  de¬ 
compositions,  we  can  find  a  partition  of  the  integers  l,...,n,  into  sets 
S(i)(l  =  1,  ...,p)  such  that, 

^S(i)s{»  =  0  for  i  <  j. 

AS(j)5(,)  is  indecomposable  for  each  i. 

Then  any  characteristic  root  of  A  is  a  characteristic  root  of  As(ijs(i)  for 
some  i,  and  conversely.  EYom  (19),  ir  is  a  characteristic  root  of  As|i)S(v) 
for  some  i.  By  Def.  3,  o( As(<)s(«))  >  0.  But,  by  Theorem  4,  <r( ) 
is  a  root  of  As(«)S(i)  and  therefore  is  a  root  of  A,  so  that, 

«jr(-4s(i)S(»})  <  o'(A)  =  0, 
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and  therefore  cr(y4 s’ )  =  0-  Hence,  (18)  and  (19)  hold  for  Asj.jsj,) 
which  is  indecomposable  and  Metzler,  by  definition,  and  we  can  say  that 
they  hold  for  some  matrix  A  which  also  satisfies, 

(20)  A  indecomposable. 

dep.  s.  For  any  vector  x,  Z(x)  =  (t|ij  =  0}. 

We  abbreviate  Z(x)  as  Z,  when  the  context  makes  it  clear. 

The  interesting  implication  of  (20)  for  the  present  purpose  is, 

lemma  j.  If  A  is  indecomposable  and  Metzler,  x  >  0,  and  Z(x)  is 
non-empty,  the  ( Ax)z  ^  0. 

proof:  Since  x  >  0,  Z'  is  non-empty.  By  Def.  5,  xz  =  0,  xz •  >■  0. 

Hence, 

(Ax)z  =  Azzxz  +  Azz'Xz-  =  Azz-xz-, 

Since  A  is  Metzler,  all  elements  of  Azz •  are  non-negative.  Since  A  is 
indecomposable,  at  least  one  element  of  Azzm  i*  non-zero  and  therefore 
positive.  Since  i z«  »  0,  it  must  be  that  (Ax)z  >  0. 

FYom  (18)  and  Theorem  4,  there  exists  x*  >  0  such  that  Ax'  —  0. 
If  Z(x')  were  non-empty,  then  certainly  ( Ax‘)z  =  0,  in  contradiction  to 
Lemma  2.  Hence,  Z(x‘)  is  empty. 

(21)  There  exists  x*  »  0  such  that  Ax'  =  0. 

Let  u  +  iv  (u,  v  real)  be  a  characteristic  vector  of  A  corresponding  to  the 
root  ir.  Then  it  is  easy  to  see  that  u  is  any  vector  satisfying. 

(22)  A2u  =  —  r2u,  u  ^  0. 

Then  the  solution  of  the  differential  equation. 

(23)  x  =  Ax, 


with  initial  condition  z(0)  =  u  is, 
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(24)  x'(t)  =  ucosrt  +  (l/f)Ausin  rt. 

If  u  >  0,  then,  by  Theorem  1,  x'(t )  >  0  for  all  t  >  0,  and  in  particular 
for  t  =  ir/r.  But  i'(jt/t)  =  -u,  so  that  u  =  0,  a  contradiction.  Hence, 
it  is  impossible  that  u  >  0.  Since  —  u  also  satisfies  (22),  it  cannot  be  that 
-u  >  0. 

(25)  If  u  satisfies  (22),  then  u,  >  0  for  some  t,  u3  <  0  some  j. 

For  any  x*  satisfying  (21),  the  solution  to  (23)  with  x(0)  =  x*  +  u  is, 

(26)  x(t)  =  x*  +  x'(t). 

From  (21)  and  (25),  x"  can  be  chosen  so  that  x*  +u  >  0,  with  (x*  +u),  =  0 
for  at  least  one  i,  so  that  Z(x‘  +  u)  is  non-empty.  Since  u;  >  0  for  some 
x,  by  (25),  (x*  +  u)j  >  0  and  therefore  x*  +  ti  >  0.  Then  x(t)  >  0  for  all 
t  >  0;  but  since  x'(t)  is  periodic,  in  fact,  x(t)  >  0  for  all  t.  If  teZ(x*  +  u), 
then  x,(0)  =  0,  Xi(t)  >  0  for  all  t;  therefore,  ii(0)  =  0  for  all  teZ.  FYom 
the  differential  equation  (23),  [Az(0)]z  =  0,  in  contradiction  to  Lemma  2. 
Hence,  the  existence  of  a  purely  imaginary  characteristic  root  leads  to  a 
contradiction. 

theorem  If  A  is  Mttzler,  then  there  exists  o  real  number  a  and 

a  real  vector  x  >  0  such  that,  (a)  Ax  =  ox,  and  fb)  R(  A)  <  a  if  A  is  any 
characteristic  root  of  A  with  X  ^  o, 
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